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Background: Coronaviruses (CoVs) primarily cause enzootic infections in birds and mammals but, in the last few 
decades, have shown to be capable of infecting humans as well. The outbreak of severe acute respiratory syndrome 
(SARS) in 2003 and, more recently, Middle-East respiratory syndrome (MERS) has demonstrated the lethality of CoVs 
when they cross the species barrier and infect humans. A renewed interest in coronaviral research has led to the 
discovery of several novel human CoVs and since then much progress has been made in understanding the CoV life 
cycle. The CoV envelope (E) protein is a small, integral membrane protein involved in several aspects of the virus’ life 
cycle, such as assembly, budding, envelope formation, and pathogenesis. Recent studies have expanded on its 
structural motifs and topology, its functions as an ion-channelling viroporin, and its interactions with both other CoV 


proteins and host cell proteins. 


Main body: This review aims to establish the current knowledge on CoV E by highlighting the recent progress that 
has been made and comparing it to previous knowledge. It also compares E to other viral proteins of a similar nature 
to speculate the relevance of these new findings. Good progress has been made but much still remains unknown and 
this review has identified some gaps in the current knowledge and made suggestions for consideration in future 


research. 


Conclusions: The most progress has been made on SARS-CoV E, highlighting specific structural requirements for its 
functions in the CoV life cycle as well as mechanisms behind its pathogenesis. Data shows that E is involved in critical 
aspects of the viral life cycle and that CoVs lacking E make promising vaccine candidates. The high mortality rate of 
certain CoVs, along with their ease of transmission, underpins the need for more research into CoV molecular biology 
which can aid in the production of effective anti-coronaviral agents for both human CoVs and enzootic CoVs. 
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Background 

Coronaviruses (CoVs) (order Nidovirales, family Corona- 
viridae, subfamily Coronavirinae) are enveloped viruses 
with a positive sense, single-stranded RNA genome. 
With genome sizes ranging from 26 to 32 kilobases (kb) 
in length, CoVs have the largest genomes for RNA vi- 
ruses. Based on genetic and antigenic criteria, CoVs have 
been organised into three groups: a-CoVs, B-CoVs, and 
y-CoVs (Table 1) [1, 2]. Coronaviruses primarily infect 
birds and mammals, causing a variety of lethal diseases 
that particularly impact the farming industry [3, 4]. They 
can also infect humans and cause disease to varying de- 
grees, from upper respiratory tract infections (URTIs) 
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resembling the common cold, to lower respiratory tract 
infections (LRTIs) such as bronchitis, pneumonia, and 
even severe acute respiratory syndrome (SARS) [5-14]. 
In recent years, it has become increasingly evident that 
human CoVs (HCoVs) are implicated in both URTIs and 
LRTIs, validating the importance of coronaviral research 
as agents of severe respiratory illnesses [7, 9, 15-17]. 
Some CoVs were originally found as enzootic infections, 
limited only to their natural animal hosts, but have crossed 
the animal-human species barrier and progressed to estab- 
lish zoonotic diseases in humans [19-23]. Accordingly, these 
cross-species barrier jumps allowed CoVs like the SARS- 
CoV and Middle Eastern respiratory syndrome (MERS)- 
CoV to manifest as virulent human viruses. The consequent 
outbreak of SARS in 2003 led to a near pandemic with 8096 
cases and 774 deaths reported worldwide, resulting in a 
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Table 1 Organisation of CoV species (adapted from Jimenez- 
Guardeno, Nieto-Torres [18]) 


Group 
a-CoVs 


Species 


Transmissible gastroenteritis coronavirus (TGEV) 
Canine coronavirus (CCoV) 

Porcine respiratory coronavirus (PRCoV) 

Feline coronavirus (FeCoV) 

Porcine epidemic diarrhoea coronavirus (PEDV) 
Human coronavirus 229E (HCoV-229E) 

Human coronavirus NL63 (HCoV-NL63) 

B-CoVs Bat coronavirus (BCoV) 

Porcine hemagglutinating encephalomyelitis virus (HEV) 
Murine hepatitis virus (MHV) 

Human coronavirus 4408 (HCoV-4408) 

Human coronavirus OC43 (HCoV-OC43) 

Human coronavirus HKU1 (HCoV-HKU1) 

Severe acute respiratory syndrome coronavirus (SARS-CoV) 
Middle Eastern respiratory syndrome coronavirus (MERS-CoV) 
y-CoVs Avian infectious bronchitis virus (IBV) 


Turkey coronavirus (TCoV) 


fatality rate of 9.6% [24]. Since the outbreak of MERS in 
April 2012 up until October 2018, 2229 laboratory- 
confirmed cases have been reported globally, including 791 
associated deaths with a case-fatality rate of 35.5% [25]. 
Clearly, the seriousness of these infections and the lack of ef- 
fective, licensed treatments for CoV infections underpin the 
need for a more detailed and comprehensive understanding 
of coronaviral molecular biology, with a specific focus on 
both their structural proteins as well as their accessory pro- 
teins [26-30]. Live, attenuated vaccines and fusion inhibitors 
have proven promising, but both also require an intimate 
knowledge of CoV molecular biology [29, 31-36]. 

The coronaviral genome encodes four major structural 
proteins: the spike (S) protein, nucleocapsid (N) protein, 
membrane (M) protein, and the envelope (E) protein, all 
of which are required to produce a structurally complete 
viral particle [29, 37, 38]. More recently, however, it has 
become clear that some CoVs do not require the full 
ensemble of structural proteins to form a complete, in- 
fectious virion, suggesting that some structural proteins 
might be dispensable or that these CoVs might encode 
additional proteins with overlapping compensatory func- 
tions [35, 37, 39-42]. Individually, each protein primarily 
plays a role in the structure of the virus particle, but they 
are also involved in other aspects of the replication 
cycle. The S protein mediates attachment of the virus to 
the host cell surface receptors and subsequent fusion be- 
tween the viral and host cell membranes to facilitate 
viral entry into the host cell [42—44]. In some CoVs, the 
expression of S at the cell membrane can also mediate 
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cell-cell fusion between infected and adjacent, unin- 
fected cells. This formation of giant, multinucleated 
cells, or syncytia, has been proposed as a strategy to 
allow direct spreading of the virus between cells, sub- 
verting virus-neutralising antibodies [45-47]. 

Unlike the other major structural proteins, N is the 
only protein that functions primarily to bind to the CoV 
RNA genome, making up the nucleocapsid [48]. Al- 
though N is largely involved in processes relating to the 
viral genome, it is also involved in other aspects of the 
CoV replication cycle and the host cellular response to 
viral infection [49]. Interestingly, localisation of N to the 
endoplasmic reticulum (ER)-Golgi region has proposed a 
function for it in assembly and budding [50, 51]. How- 
ever, transient expression of N was shown to substan- 
tially increase the production of virus-like particles 
(VLPs) in some CoVs, suggesting that it might not be re- 
quired for envelope formation, but for complete virion 
formation instead [41, 42, 52, 53]. 

The M protein is the most abundant structural protein 
and defines the shape of the viral envelope [54]. It is also 
regarded as the central organiser of CoV assembly, inter- 
acting with all other major coronaviral structural pro- 
teins [29]. Homotypic interactions between the M 
proteins are the major driving force behind virion enve- 
lope formation but, alone, is not sufficient for virion for- 
mation [54—56]. Interaction of S with M is necessary for 
retention of S in the ER-Golgi intermediate compart- 
ment (ERGIC)/Golgi complex and its incorporation into 
new virions, but dispensable for the assembly process 
[37, 45, 57]. Binding of M to N stabilises the nucleocap- 
sid (N protein-RNA complex), as well as the internal 
core of virions, and, ultimately, promotes completion of 
viral assembly [45, 58, 59]. Together, M and E make up 
the viral envelope and their interaction is sufficient for 
the production and release of VLPs [37, 60-64]. 

The E protein is the smallest of the major structural 
proteins, but also the most enigmatic. During the replica- 
tion cycle, E is abundantly expressed inside the infected 
cell, but only a small portion is incorporated into the vir- 
ion envelope [65]. The majority of the protein is localised 
at the site of intracellular trafficking, viz. the ER, Golgi, 
and ERGIC, where it participates in CoV assembly and 
budding [66]. Recombinant CoVs have lacking E exhibit 
significantly reduced viral titres, crippled viral maturation, 
or yield propagation incompetent progeny, demonstrating 
the importance of E in virus production and maturation 
[35, 39, 40, 67, 68]. 


Main text 

The envelope protein 

Structure 

The CoV E protein is a short, integral membrane protein 
of 76-109 amino acids, ranging from 8.4 to 12 kDa in 
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size [69-71]. The primary and secondary structure re- 
veals that E has a short, hydrophilic amino terminus 
consisting of 7-12 amino acids, followed by a large 
hydrophobic transmembrane domain (TMD) of 25 
amino acids, and ends with a long, hydrophilic carboxyl 
terminus, which comprises the majority of the protein 
(Fig. 1) [1, 60, 72-75]. The hydrophobic region of the 
TMD contains at least one predicted amphipathic a- 
helix that oligomerizes to form an ion-conductive pore 
in membranes [76—78]. 

Comparative and phylogenetic analysis of SARS-CoV E 
revealed that a substantial portion of the TMD consists of 
the two nonpolar, neutral amino acids, valine and leucine, 
lending a strong hydrophobicity to the E protein [79]. The 
peptide exhibits an overall net charge of zero, the middle 
region being uncharged and flanked on one side by the 
negatively charged amino (N)-terminus, and, on the other 
side, the carboxy (C)-terminus of variable charge. The 
C-terminus also exhibits some hydrophobicity but less than 
the TMD due to the presence of a cluster of basic, posi- 
tively charged amino acids [80]. Computational predictions 
regarding the secondary structure of E suggest that the C- 
terminus of B- and y-CoVs also contains a conserved pro- 
line residue centred in a B-coil-6B motif [72]. This motif 
likely functions as a Golgi-complex targeting signal as mu- 
tation of this conserved proline was sufficient to disrupt 
the localization of a mutant chimeric protein to the Golgi 
complex and instead localized the protein to the plasma 
membrane [81]. 

The SARS-CoV E protein has recently been found to 
contain a binding motif known as the postsynaptic density 
protein 95 (PSD95)/Drosophila disc large tumour suppres- 
sor (Dlg1)/zonula occludens-1 protein (zo-1) (PDZ)-bind- 
ing motif (PBM), located in the last four amino acids of the 
C terminus [82]. The PDZ domain is a protein-protein 
interaction module that can bind to the C-terminus of tar- 
get proteins such as the cellular adapter proteins involved 
in host-cell processes important for viral infection [83-86]. 
Some interaction partners capable of binding to the PBM 
of SARS-CoV E have been identified and appears to be 
involved in the pathogenesis of SARS-CoV [18, 66, 82, 87]. 

The importance of the PBM domain was recently dem- 
onstrated in SARS-CoV-infected cells [88]. The PBM do- 
main was either mutated or deleted but reverted to a 
pathogenic state after several passages in Vero E6 host 
cells. Deletion of either the last nine resides of SARS-CoV 
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E (APBM) or mutation of the four PBM residues to gly- 
cine (mutPBM) resulted in the acquisition of a PBM at the 
C-terminus of E that was similar to the original PBM se- 
quence. Deleting the last 12 residues of E (A6), including 
the PBM, caused viruses to acquire an alternative PBM 
different from the sequence of the original PBM. Of par- 
ticular interest is the mutation of only two of the PBM 
residues to alanine (altPBM) as these mutants maintained 
the same mutated sequence after serial passage of infected 
cells. This suggests that, at least for SARS-CoV E, some 
minor PBM mutations appear to be tolerated but that a 
reasonably intact PBM domain is still necessary to avoid 
revertant mutants [34, 88]. It would be interesting to see if 
any of these serially passaged PBM mutants are still cap- 
able of host cell protein interaction and whether the mu- 
tations allow the virus to retain its pathogenicity in both 
in vivo and in vitro systems. This would prove valuable for 
the design of a live, attenuated vaccine with a PBM suffi- 
ciently mutated to remain intact, but also enough to be 
non-functional and abolish the pathogenicity of the virus. 


Localisation 

Coronaviruses are distinct from other well-studied 
enveloped viruses in that they bud into the ERGIC, from 
where they acquire their membrane envelope [89]. Once 
in the lumen of the ERGIC, infectious virions make their 
way through the host secretory pathway to, ultimately, 
be released from the infected cell [90]. Accordingly, the 
E protein is localized mainly to the ER and Golgi- 
complex where it participates in the assembly, budding, 
and intracellular trafficking of infectious virions [56, 66, 
71, 91]. Concern has been raised over the possibility of 
epitope-tagged E proteins affecting its localisation, but 
both FLAG-tagged and untagged versions of SARS-CoV 
E demonstrate this distribution pattern [73, 81, 92]. 
Nieto-Torres, DeDiego [66] also investigated the subcel- 
lular localization of the SARS-CoV E protein using both 
transfected cells and infected cells and found that in 
both groups of cells E accumulated at the ER-Golgi, sug- 
gesting that the presence of the tag on E did not affect 
its localization. The authors also reported that the other 
viral structural proteins did not appear to significantly 
influence the localization of the E protein, concluding 
that localization of SARS-CoV E occurs at the ERGIC, 
whether expressed alone or during an infection. Al- 
though studies investigating the localisation of E have 
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Fig. 1 Amino Acid Sequence and Domains of the SARS-CoV E Protein. The SARS-CoV E protein consists of three domains, i.e. the amino (N)- 
terminal domain, the transmembrane domain (TMD), and the carboxy (C)-terminal domain. Amino acid properties are indicated: hydrophobic 


(red), hydrophilic (blue), polar, charged (asterisks) [78] 
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only used FLAG-tagged versions of the protein, the 
results suggest that epitope tags do not appear to have 
any significant influence on the localisation of the CoV 
E protein to the ER-Golgi region. However, there is no 
evidence to support whether the presence of larger 
epitope-tags, such as glutathione S-transferase (GST) 
and green-fluorescent protein (GFP), might interfere 
with CoV E protein localisation. 

Establishing which part of the E protein contains the 
information responsible for targeting to the ERGIC is 
important as it might allude to how CoVs interact with 
both other viral proteins and host proteins to facilitate 
the assembly of new infectious viral progeny. However, 
research into this aspect has been sparse. Using SignalP, 
Wu, Zhang [79] reported a predicted signal peptide 
cleavage site at the N-terminus of the SARS-CoV E 
protein. However, Raamsman, Locker [71] reported no 
difference in the electrophoretic mobility of the mouse 
hepatitis virus (MHV) A59 E protein during or after its 
membrane integration and concluded that MHV E has 
no cleavable signal peptide sequence. Corse and Macha- 
mer [93] were the first to identify that the C-terminus of 
the IBV E protein housed the Golgi-targeting informa- 
tion. They explored the possibility of a targeting signal 
located in the luminal N-terminus but found the trun- 
cated terminus to be transported to the cell surface. 
Conversely, truncation of the C-terminus and produc- 
tion of a chimeric E protein both demonstrated reten- 
tion at the Golgi complex, leading the authors to 
conclude that the Golgi-targeting information of the IBV 
E protein was located in its C-terminus. Further trunca- 
tion of the C-terminus narrowed down the bulk of the 
targeting information to a sequence motif located 
between amino acid residues 44 and 72. 

Building on this, Cohen, Lin [81] found that the Golgi 
complex-targeting information of the SARS-CoV E protein 
was also located in the C-terminus. The authors specifically 
demonstrated that neither the mutation of a highly con- 
served proline residue nor the disruption of the predicted 
6-strands, that stabilise the 6-hairpin on either side of the 
conserved proline residue, were sufficient to disrupt the 
targeting of the SARS-CoV E protein to the Golgi complex. 
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Using an N-terminus chimeric protein, the authors went 
on to investigate the possibility of Golgi-targeting informa- 
tion present in the E protein N-terminus. Interestingly, the 
N-terminus chimaera was targeted to the Golgi region and 
the authors concluded that the N-terminus of the SARS- 
CoV E protein contains additional targeting information. 
They further remarked that the existence of targeting in- 
formation in both the N- and C-terminus likely explains 
why the localization of full-length E proteins with muta- 
tions only in the C-terminus was not disrupted. From these 
studies, it is evident that Golgi-targeting information is lo- 
cated primarily in the C-terminus of CoV E, but it appears 
that for some CoVs, like SARS-CoV E, additional targeting 
information could be found in the N-terminus. 


Topology 

A variety of different E protein topologies have been de- 
scribed and proposed for the different CoVs. Some studies 
have used prediction programs with conflicting predictions 
between the programs and some in conflict with the experi- 
mental evidence (Table 2). Infection and transient transfec- 
tion experiments have shown that the C-terminus of the 
IBV E is located cytoplasmically while its N-terminus is 
located in the lumen of the Golgi complex [60]. The C- 
terminus of MHV E is also located cytoplasmically, but no 
N-terminus was detected. Based on the hydropathy plot of 
the protein, the authors suggested that it might be buried 
inside the lipid bilayer [71]. The C-terminus was confirmed 
to be in the cytoplasm and that the highly hydrophobic N- 
terminus causes it to be buried within the Golgi membrane 
[94]. Conversely, the TGEV E protein exhibits a topology of 
a luminal C-terminus and a cytoplasmic N-terminus [95]. 
To date, however, the topology of the SARS-CoV E protein 
has received the most attention. A FLAG-tagged SARS- 
CoV E protein, Yuan, Liao [91] was reported to assume an 
N- and C-terminus cytoplasmic topology. Prediction soft- 
ware demonstrated conflicting predictions between both 
the software and the experimental evidence; TMHMM and 
MEMSAT predicted a cytoplasmic N-terminus and a 
luminal C-terminus, while HMMTop predicted a luminal 
N-terminus and a cytoplasmic C-terminus. Moreover, 
transfected and infected cells expressing untagged SARS- 


Table 2 Prediction programs showing membrane topologies of four different CoV E proteins with predicted locations of N- and C- 
termini, and TMDs. Prediction programs used: TM Pred, HMMTop, TMHMM 2.0, MEMSAT3, and MEMSAT-SVM [96-100]. Taken from 


Ruch and Machamer [41] 


Prediction IBVE MHV E SARS E TGEV E 
Program 
N ‘@ TMDs N é TMDs N C TMDs N c TMDs 

TM Pred lumen lumen 2 lumen cyto lumen cyto 1 lumen cyto 1 
HMMTop lumen lumen 2 cyto cyto 2 lumen cyto 1 cyto cyto 2 
TMHMM 2.0 lumen lumen 2 lumen cyto cyto lumen 1 lumen cyto 1 
MEMSAT-SVM lumen lumen Z lumen lumen Z lumen lumen 2 cyto lumen 1 
MEMSAT3 cyto cyto 2 lumen cyto lumen lumen 2 lumen cyto 1 
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CoV E exhibited a luminal N-terminus and a cytoplasmic 
C-terminus topology [66]. Given the variety of different 
topologies, the number of TMDs for the CoV E protein 
have also been inconclusive. 

The prediction programs in Table 2 likely conflict in 
their predicted outcomes based on the algorithm used by 
each program and/or the window size that was used to 
calculate the result. The design of algorithms used in pre- 
diction programs requires an array of aspects to be taken 
into consideration, largely those involved in machine 
learning, which makes identifying the exact reason(s) for 
the difference in predictions between programs challen- 
ging [101]. Nevertheless, the main reason likely stems 
from differences in the features unique to each algorithm, 
such as, whether the algorithm would include multiple 
features of the target protein(s) or only a clearly defined 
set of criteria; how accurately the algorithm should 
discriminate between the different features; the point at 
which the specificity or sensitivity for a certain feature is 
defined as too broad or too narrow [102]. The calculations 
used to design the algorithm along with its cut-off values 
should also be taken into consideration, all of which only 
speak to one aspect of machine learning. Nevertheless, 
some proteins prove challenging to isolate and not all bio- 
chemical techniques offer the needed high-resolution 
structural detail, in which case prediction programs are a 
good alternative and offer valuable insight into the pre- 
dicted outcomes [101]. 

Many prediction programs also make use of a sliding 
window method to predict certain structural features of 
a protein. It is based on a window size that covers de- 
fined fragments of the amino acid sequence, rather than 
the whole sequence and takes into account that a given 
characteristic of an amino acid is not only determined 
by the residue itself, but also by the adjacent residues 
[103]. This method is widely used in the prediction of 
hydrophobicity, flexibility and rigidity, secondary struc- 
ture, and tertiary structure of proteins [104—108]. It is 
possible that a standard window size, corresponding to a 
stretch of residues in the sequence, was not used be- 
tween the prediction programs, or even between differ- 
ent CoVs, which might have resulted in the different 
topological predictions for each of the CoVs in Table 2. 
Based on a probabilistic approach, the prediction of 
structural features, such as coils and strands, would 
benefit from smaller window sizes as residues up to 
three and six positions away from the central residue, re- 
spectively, can affect the formation of these structures. 
Conversely, helical structure formation can be affected 
by up to nine residues away from the central residue 
and would benefit from a larger window size [103]. Ac- 
cordingly, the use of a standardised, optimal window size 
could prove beneficial to obtain a more consistent and 
accurate topological prediction for CoV E. 
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The experimental evidence described in the previous 
section strongly suggests that the presence of an epitope 
tag does not interfere with the localisation of the CoV E 
protein. However, the use of epitope tags has been criti- 
cized for its interference with the properties or features 
of the tagged protein [41, 66]. By tagging the N- 
terminus of the IBV E protein with a FLAG tag, Ruch 
and Machamer [109] succeeded in producing a mem- 
brane hairpin conformation, with the N- and C-termini 
oriented cytoplasmically. However, the untagged E pro- 
tein exhibited the topological conformation of a single 
transmembrane-spanning protein, demonstrating that 
the topology may be altered by the presence of the N- 
terminal tag [66]. Other reports proposing the hairpin 
conformation have also made use of N-terminal epitope- 
tagged CoV E proteins [91, 109]. 

The rationale for the multiple membrane topologies has 
been suggested in that, between the different CoV species, 
the E protein might not exhibit a uniform membrane top- 
ology or that the orientation of E varies depending on the 
level of protein expression or oligomerization [69]. Alter- 
natively, the function of the E protein might dictate its 
membrane topology, depending on whether it is required 
to function as an ion channel or its involvement in the 
viral envelope during assembly [41]. 


Post-translational modifications 

Palmitoylation 

Palmitoylation functions in the subcellular trafficking of 
proteins between membrane compartments and can also 
modulate protein-protein interactions (PPIs) [110, 111]. 
Palmitoylated proteins have an increased hydrophobicity, 
which has been reported to assist in membrane associ- 
ation and also functions in membrane anchoring [112, 
113]. Palmitoylated viral proteins are well-represented in 
enveloped viruses, including the haemagglutinin (HA) 
protein of the influenza virus, Env of retroviruses and 
filoviruses, and F13 L of the vaccinia virus [114]. In the 
vaccinia virus, palmitoylation of its F13 L protein has 
been shown to be essential for targeting to the appropri- 
ate membranes [115]. The hepatitis C virus (HCV) 
nucleocapsid core protein binds to ER membranes in a 
palmitoylation-dependent manner for the formation of 
viral particles [116]. 

Of the CoV E proteins, only IBV, SARS-CoV, and 
MHV have been found to be palmitoylated [73, 93, 117]. 
A number of integral membrane proteins are substrates 
for palmitoylation where the cysteine residues adjacent 
to the TMDs serve as the targets [118, 119]. Double or 
triple mutation of the cysteine residues on the MHV- 
A59 E protein to alanine significantly reduces VLP for- 
mation [52, 117]. Furthermore, triple-mutated E proteins 
are unstable, prone to degradation, and significantly re- 
duces the viral yield of the corresponding recombinant 
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MHV, suggesting that palmitoylation of E plays an es- 
sential part in the viral assembly of MHV [117]. Palmi- 
toylation of IBV E does not affect its localization to the 
Golgi region, as cysteine-mutated E proteins are indis- 
tinguishable from their palmitoylated counterparts [93]. 
Interestingly, mutation of certain hydrophobic residues 
in the TMD along with all three cysteine residues of 
SARS-CoV E protein disrupted targeting to the Golgi 
[73]. Although the authors did not demonstrate the 
localization pattern of the triple-mutated E protein on 
its own, the results suggest that palmitoylation alone of 
the SARS-CoV E protein does not affect its localization. 
Rather, it is possible that a loss of both the Golgi- 
targeting information in the TMD and the palmitoylated 
cysteine residues leads to the loss of localization as well 
as membrane its association [65]. Lopez, Riffle [117] 
suggested that palmitoylation of the E protein might 
affect how it interacts with the membrane. The position 
of the palmitoylated cysteine residues in relation to the 
hydrophobic TMD likely increases the region’s affinity 
for the membrane, serving to alter or stabilise associ- 
ation between the protein and the membrane. 


Myristoylation 

Linkage of myristic acid (C14:0) to the N-terminal of a gly- 
cine residue found on some viral, cellular, or bacterial pro- 
teins, is known as N-terminal myristoylation [120-123]. 
Several viral proteins are myristoylated including the polio- 
virus VP4 protein, simian immunodeficiency virus (SIV) 
Gag protein, human immunodeficiency virus (HIV) nega- 
tive regulatory factor (Nef) protein, and the pre-S1 protein 
of the hepatitis B virus (HBV) [124-127]. All of these pro- 
teins contain the conserved sequence motif ,MGxxxS/T, 
where ‘x’ can be any amino acid [80]. Coronavirus E pro- 
teins, along with other members of the Nidovirales order, 
reportedly have no myristoylation motif and it is suggested 
to be a feature unique only to the Arteriviridae family in 
the order of Nidovirales [80]. However, there appears to be 
no experimental evidence to support this. 


Ubiquitination 

Ubiquitination and its counterpart, deubiquitination, are 
well-characterised post-translational modifications with 
that serve to maintain homeostasis through the regulation 
of cellular protein levels and their functions [128]. Viruses 
can exploit this component of the host cell machinery, or 
even encode their own ubiquitinating/deubiquitinating en- 
zymes to drive the viral life cycle [129]. Only SARS-CoV E 
has so far been reported to be ubiquitinated, although the 
relevance has not yet been determined. The SARS-CoV 
non-structural protein (nsp) 3 co-localises with E and its 
interaction was mediated through the N-terminal 
ubiquitin-like domain-1 of nsp3. Independently, a ubiqui- 
tination assay further demonstrated that E can be 
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ubiquitinated and that its ubiquitination status inversely 
correlates to its stability and half-life [128, 130]. Moreover, 
given the late expression of SARS-CoV accessory protein 
8b, Keng, Akerstrém [130] suggested that it might func- 
tion to modulate viral production by down-regulating E 
production and in doing so maintain an optimal viral titre. 
However, this will need to be confirmed in the context of 
a natural infection. 


Glycosylation 

In N-linked glycosylation, oligosaccharide moieties are 
attached to specific asparagine residues located in the 
consensus sequence Asn-X-Ser/Thr. It aids in the proper 
folding and trafficking of cellular and viral proteins by 
actively recruiting host chaperone proteins such as cal- 
nexin and calreticulin [131]. Very little information is 
available on the glycosylation of CoV E and its role. The 
IBV E protein has been suggested to contain a single gly- 
cosylation site in its luminal N-terminus, while SARS- 
CoV E has been predicted to contain two potential gly- 
cosylation sites [132]. Based on the topology of IBV E, 
Corse and Machamer [60] proposed that it could be gly- 
cosylated on asparagine residue five (N5) of the N- 
terminus. However, this was found not to be the case, 
likely due to the proximity of the residue to the mem- 
brane [133]. Likewise, residue N48 in SARS-CoV E was 
also shown not to be glycosylated and proposed to be 
non-functional for the same reason [73]. Conversely, 
residue N66 was shown to be glycosylated and, more 
interestingly, mutation of this residue generated higher 
molecular weight forms resembling dimers and trimers 
of the E protein. This suggests that glycosylation of N66 
might function to prevent oligomerization of the E pro- 
tein, possibly to promote a specific role of the E protein. 
Accordingly, multimeric forms of the E protein may not 
be glycosylated on N66 possibly to promote the func- 
tioning of E in other capacities [134]. Westerbeck and 
Machamer [90] used both infected and transfected cells 
and reported the presence of two different forms of the 
IBV E protein, each associated with a specific function. 
They proposed that the lower molecular weight, possibly 
monomeric form, functions in disruption of the host 
secretory pathway, while the higher molecular weight 
oligomeric form is required for virion assembly. Clearly, 
more research is needed to determine whether all CoV 
E proteins are glycosylated, or whether it is unique to 
SARS-CoV that might confer to it certain pathogenic 
features, and what the importance of E protein glyco- 
sylation is. 


Protein-protein interactions: Intraviral 

Membrane and envelope proteins 

Co-localization of and interaction between M and E is 
probably the most well-established and characterised of 
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PPIs between the CoV structural proteins [56, 60, 61, 
117]. Co-expression of M and E is sufficient for VLP for- 
mation and release [37, 60—64]. The interaction is medi- 
ated by the C-termini of both proteins and occurs on 
the cytoplasmic side of the ERGIC [56, 61, 89]. The 
importance of these domains is evident by the drastic 
reduction of VLPs upon deletion of the domains [56]. 


Envelope and envelope proteins 

The CoV E protein is unique in that it can form homo- 
typic interactions, which allows it to oligomerise and 
generate an ion-channel protein known as a viroporin 
[135, 136]. Biochemical evidence suggests that the ability 
of CoV E to form homo-oligomeric multimers is 
dependent on its TMD. Synthetic peptides that corres- 
pond to the SARS-CoV E TMD can form dimers, 
trimers, and pentamers, demonstrating the importance 
of the TMD in CoV E homotypic interactions [137]. This 
was ability to produce multimeric homo-oligomers was 
confirmed by expression of SARS-CoV E in Sf9 insect 
cells. Substituting certain hydrophobic residues in the 
TMD with charged residues, significantly alters the elec- 
trophoretic migration rate of E to the extent that only 
monomers are observed [73]. To date, not many studies 
have investigated which TMD residues are required for 
CoV E homotypic interactions. Mutation of the TMD 
residues asparagine 15 (N15) to alanine (N15A) and val- 
ine 25 (V25) to phenylalanine (V25F) have been found 
to abolish the ion channelling capability of CoV E viro- 
porin, a structure dependent on its homopentameric 
conformation [75, 76, 138]. Interestingly, mutation of 
N15A and V25F, respectively, appear to hamper the 
oligomerisation of CoV E, at least to some extent. The 
appearance of monomers in response to V25F clearly 
suggests that this residues plays a more crucial role in 
oligomerisation, as opposed to N15A, which appears to 
reduce the amount of pentamers only slightly [139]. The 
ability of CoV E to assemble into homopentameric 
structures is clearly important in the formation of a 
functional CoV E viroporin [75, 76, 135-138, 140]. 


Nucleocapsid and envelope proteins 

It has been shown that M and E are sufficient to drive 
VLP formation in many CoVs and that its production is 
further enhanced by the co-expression of N [42, 60, 63, 
64, 141]. It is thought that E and N interact with M inde- 
pendently and are assembled separately into VLPs. Ac- 
cordingly, it is not known whether E and N interact and, 
in doing so, if this interaction is what could enhance virion 
production. Only two studies have reported a possible 
interaction between E and N, one for murine MHV and 
the other for SARS-CoV. Tseng, Wang [142] reported an 
interaction between SARS-CoV E and N mediated largely 
through the C-terminus of both proteins. Deletion of the 
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last C-terminal residue of E markedly reduced E and N 
interaction although it did not seem to significantly com- 
promise efficient VLP production. Although the study 
only looked at an E-N interaction in transfected cells, it is 
interesting to note that Maeda, Maeda [143] already found 
coimmunoprecipitation of structural proteins E and N in 
MHV-infected cells. This suggests that there might, in 
fact, be a physical interaction between E and N but the 
reason and exact requirements for this interaction remains 
to be determined. More research is needed to understand 
this interaction and whether it offers a possible explan- 
ation as to why or how VLP production is enhanced dur- 
ing the co-expression of M, E, and N [42, 52]. 


Spike and envelope proteins 

A sub-regional analysis of both E and S revealed a triple 
cysteine motif located directly after the E protein TMD 
(NH,- ... L-Cys-A-Y-Cys-Cys-N ... -COOH) and a simi- 
lar motif located in the C-terminus of S (NH>2- ... S-Cys- 
G-S-Cys-Cys-K ... -COOH) [79]. The authors proposed 
that the predicted orientation, position, and composition 
of these two motifs could serve as a structural basis for 
the association between E and S, which would be medi- 
ated by the formation of disulphide bonds between the 
corresponding cysteine residues (Fig. 2). Although this is 
yet to be proven experimentally, it would be interesting 
to see whether this interaction is indeed possible. Such 
evidence could also provide some insight into the 
debated topological conformations of the E protein and 
could confirm whether multiple topologies are possible 
to accommodate this interaction. 

Experimental data on a physical interaction between 
CoV S and E is extremely limited with the exception of 
one study, which showed that SARS-CoV S is an inter- 
acting partner of E [128]. Using a tagged E protein, the 
study aimed to identify SARS-CoV E protein interacting 
partners by a tandem affinity purification (TAP) system 
coupled with mass spectrometry (MS; TAP-MS). Al- 
though S was shown to co-purify with E, the authors did 
not pursue the mechanism or importance of this inter- 
action. This finding clearly warrants further investigation 
into an intraviral protein interaction which has not been 
investigated yet. 

Protein 7a, a structural protein unique to SARS-CoV, 
is incorporated into mature virions and plays an import- 
ant part in the pathogenesis of SARS-CoV, where it 
functions to induce apoptosis, arrest the cell cycle, and 
promote the production of pro-inflammatory cytokines 
[144-148]. In a mammalian two hybrid system, SARS- 
CoV E was found to interact with 7a, but the importance 
of this interaction has not yet been determined [149]. 
However, despite this interaction with E, 7a still appears 
to be dispensable for SARS-CoV replication both in vivo 
and in vitro [30, 150-152]. 
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Fig. 2 Predicted interaction between SARS-CoV E and S proteins through disulphide bonds [79] 


Protein-protein interactions: Host-viral 
Viruses lack the necessary machinery to self-replicate and 
are, therefore, dependent on the host cell machinery for 
propagation. Numerous viruses exploit the host cell’s repli- 
cation machinery to establish an infection by way of host- 
viral PPIs [83]. The anti-apoptotic protein B-cell 
lymphoma-extra-large (Bcl-xL) protein was the first host 
protein reported to interact with SARS-CoV E protein, al- 
luding to the possibility that the coronaviral E protein is 
also capable of host-viral PPI [87]. The domain mediating 
this PPI was only identified later when the SARS-CoV E 
protein was shown to interact with the protein associated 
with Caenorhabditis elegans lin-7 protein 1 (PALS1) [82]. It 
is now established that PALS1 bound to SARS-CoV E 
through its PDZ domain. The PDZ domain is a protein- 
protein recognition sequence found in cellular adaptor pro- 
teins that coordinate host cell signalling pathways by bind- 
ing to other proteins that have a complementary PBM. A 
number of these signalling pathways and processes are 
exploited by viruses for replication, propagation, and patho- 
genesis [153-157]. The PBM of SARS-CoV E is found in 
the last four amino acids (DLLV) of its C-terminus [1, 82]. 
To date, E has only been reported to interact with five 
host proteins, i.e. Bcl-xL, PALS1, syntenin, sodium/potas- 
sium (Na‘/K*) ATPase a-1 subunit, and stomatin [18, 66, 
82, 87]. Some context has been offered as to the relevance 
of each interaction, but it is not yet fully understood. Yang, 
Xiong [87] proposed that the interaction between E and 
Bcl-xL contributed to the SARS-CoV-induced lymphopenia 
observed in most SARS patients. Teoh, Siu [82] reported 
that the E-PALS1 interaction disrupts tight junctions in 
the lungs, suggesting a mechanism whereby SARS-CoV 
virions can breach the alveolar wall and develop into a 


systemic infection. Nieto-Torres, DeDiego [66] suggested 
that the interaction of E with Na"/K* ATPase a-1 subunit 
and stomatin, 2 proteins involved in maintaining ionic 
homeostasis, could be responsible for the reduced levels 
and activity of human epithelial sodium channels. Jimenez- 
Guardeno, Nieto-Torres [18] is the only group to have 
shown that E is a determinant of SARS-CoV pathogenesis 
in vivo. By infecting mice with recombinant SARS-CoV vi- 
ruses, they demonstrated that E caused syntenin to be 
redistributed to the cytoplasm where it triggered an overex- 
pression of inflammatory cytokines. This would give rise to 
an exacerbated immune response, resulting in tissue dam- 
age, oedema, and culminate in the characteristic acute re- 
spiratory distress syndrome (ARDS). 

Interestingly, each of the E protein interactions was 
only reported in SARS-CoV. A closer look at the pre- 
dicted PBM motif for each of the coronaviral genera a, 
6, and y reveals that the PBM motif appears to be con- 
served only among the a and 6 CoVs and is not found 
in the y CoVs (Fig. 3) [18]. As no experimental evidence 
yet speaks to any such interactions for the other a and 6 
CoVs, it remains to be seen whether the reported inter- 
action partners uniquely interact with SARS-CoV E, or 
if they can also interact with E from other coronaviral 
species from the same genus. Aside from this, it is of 
therapeutic importance that more E interaction partners 
be identified as inhibitors of p38 mitogen-activated pro- 
tein kinase (MAPK) were shown to increase the survival 
rate of mice, protecting them from a lethal infection [18, 
158]. Identifying more interaction partners for CoV E 
could provide a more targeted therapeutic approach 
where licensed coronaviral treatments are currently inef- 
fective [26-28]. 
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Fig. 3 Partial amino acid sequences of the E protein C-terminus for the different CoV genera. Red blocks represent the potential location of the 


predicted PBM motif [18] 


Functions of the envelope protein 

Despite its enigmatic nature, research conducted to date 
has been able to propose three roles for the CoV E protein. 
The interaction between the cytoplasmic tails of the M and 
E proteins drives VLP production, suggesting that E partici- 
pates in (1) viral assembly [56, 61, 89]. The hydrophobic 
TMD of E is also crucial to the (2) release of virions [40, 
53, 159]. Lastly, SARS-CoV E is implicated in the (3) patho- 
genesis of the virus [18, 82, 87]. The progress made in these 
three aspects of E will be reviewed accordingly. 


Assembly and budding: Membrane curvature 

Coronaviruses are unique among enveloped viruses in that 
assembly of the viral envelope occurs at the ERGIC. From 
there, virions bud into the lumen, navigate their way 
through the host secretory pathway, and ultimately egress 
from the cell [89, 90, 160, 161]. Although assembly of the 
viral envelope is coordinated by M, both M and E are re- 
quired for the production and release of VLPs [51, 55, 56, 
60-64, 141, 162-164]. Still, deleting the E gene from sev- 
eral recombinant CoVs does not halt virus production but 
rather cripples viral production severely or produces 
replication-competent but propagation-defective virions 
[35, 39, 40, 67, 68, 150, 165, 166]. Clearly then E is 


involved in the CoV assembly and release, but the exact 
role is not yet fully understood. 

The coronaviral envelope consists predominantly of M 
while only a small portion of E is incorporated into the 
viral envelope of virions [100, 167, 168]. Extensive elec- 
tron microscopy (EM) studies conducted on M from a 
variety of CoVs provided no indication that M is capable 
of inducing membrane curvature on its own [51, 169, 
170]. In fact, various recombinant CoVs (rCoVs) lacking 
the E gene (AE) exhibit a strikingly aberrant morph- 
ology. When C-terminus residues of MHV E were mu- 
tated to alanine, virions became temperature sensitive 
and took on pinched, elongated shapes rather than the 
typical spherical particles observed among wild type vi- 
rions [171]. Plaques of recombinant MHV-AE exhibited 
a very similar aberrant morphology, presenting as small, 
irregular-shaped plaques with jagged edges [39]. Cells in- 
fected with recombinant SARS-CoV-AE (rSARS-CoV- 
AE) contained a lower number of mature virions but 
exhibited a higher proportion of vesicles containing a 
dense, granular material. This material was proposed to 
be the result of the aborted viral assembly process that 
gave rise to immature virions [35]. Most interestingly, 
TGEV-AE-infected cells contained immature virions that 
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were blocked from being secreted into the medium. The 
absence of TGEV E arrested virus trafficking and, thereby, 
blocking full virion maturation [40]. In comparison, the 
phenotype of VLPs made up of M and E are described as 
smooth and indistinguishable from, or resembling, wild 
type virions, placing this morphology in stark contrast to 
that observed of virions lacking E [37, 63, 64]. Clearly, even 
though viral assembly and production is not completely 
stopped in the absence of E, the aberrant morphology of 
AE-virions strongly suggests that E participates in the as- 
sembly process. Most likely then, instead of coordinating 
viral assembly, the function of E is rather to induce mem- 
brane curvature of the viral envelope, thereby allowing 
CoV particles to acquire their characteristic spherical shape 
and morphology. 

Coronavirus-infected cells contain several different 
membranous structures, including double-membrane 
vesicles (DMVs) and convoluted membranes (CMs) 
[172-175]. However, little is known about exactly 
how these structures are formed and which viral and/ 
or host proteins are involved in this process. Co- 
expression of SARS-CoV nsps 3, 4, and 6 can induce 
membrane rearrangements that resemble the DMVs 
and CMs observed in CoV-infected cells [176]. The 
luminal loops present in full-length nsp3 and nsp4 
are essential for the formation of the replicative struc- 
tures seen in SARS-CoV-infected cells [176, 177]. 
Moreover, the cysteine residues located in the luminal 
loop nsp4 appear to be particularly important in the 
process of ER membrane rearrangement [177]. Hage- 
meijer, Monastyrska [177] proposed a model in which 
the luminal loops located between the transmembrane 
regions of nsp3 and 4 interact with one another to 
initiate the rearrangement of ER membranes and 
induce membrane curvature to form DMVs (Fig. 4). 
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This underpins the importance of establishing a 
unanimous topology for the E protein as this model 
could be applied to the induction of membrane curva- 
ture by E, provided E can assume multiple topologies 
during an infection. Should it be demonstrated that E 
can take on a topology with a luminal loop, this would 
not be inconceivable as a possible mechanism for the 
induction of membrane curvature initiated by E or in 
which E participates. Equally, as heterotypic interac- 
tions of nsp3 and 4 are required to induce ER mem- 
brane curvature, and the expression of both M and E 
is required for the formation of smooth, spherical CoV 
VLPs, it would be interesting to see if a heterotypic 
interaction between M and E could drive membrane 
curvature by a similar mechanism [176-178]. Alterna- 
tively, no research exists on the exact purpose of the 
N-terminus of E. Perhaps homotypic interactions me- 
diated by the N-termini of alternating E proteins could 
be responsible for inducing membrane curvature by a 
similar mechanism. It is also worth noting that the 
mutation of each of the cysteine residues located in 
the nsp4 luminal loop abrogated the ability of nsp4 to 
rearrange the ER membranes [177]. This is interesting 
because cysteine residues are substrates for the palmi- 
toylation of proteins associated with membranes [113]. 
Perhaps this corroborates the requirement of E palmi- 
toylation, not in assembly per se, but rather by anchor- 
ing E during the induction of membrane curvature. It 
is quite evident that although a lot of progress has 
been made in determining the role of E in assembly, 
much still remains unknown. The role of E has also 
been proposed to be merely catalytic by functioning to 
pinch off, or in the scission of, the viral particle from 
the ER membrane during the terminal phase of 
budding [63]. 
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Fig. 4 Model proposed by Hagemeijer, Monastyrska [177] for the induction of ER membrane curvature. The luminal loops of CoV nsp3 and 4 are 
required to initiate rearrangement of the ER membrane and produce the DMVs characteristically seen in CoV-infected cells 
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Assembly and budding: Scission 

The viral envelope is formed primarily during assembly 
and culminates when the virion buds from the host 
membrane, a process known as scission [179]. Broadly, 
enveloped viruses can accomplish membrane scission ei- 
ther by hijacking/exploiting the host cell’s scission ma- 
chinery or through the expression of their own scission 
proteins [179]. In the absence of scission machinery, the 
budding process begins but ultimately stops, and render 
budding virions attached to the membrane by a small 
membranous neck. This causes virions to have an un- 
characteristically elongated morphology sometimes re- 
ferred to as “beads-on-a-string” and is seen in viruses 
that lack the necessary machinery to release the budded 
virion [179-183]. This is clearly and elegantly demon- 
strated in the mutation of the matrix-2 (M2) protein, a 
viral protein responsible for the budding and scission of 
the influenza virus. Virions that have failed to undergo 
scission remain attached to the host cell membrane by a 
membranous neck. The budding process is reinitiated at 
the site where scission failed, and a new virion is formed. 
However, the new virion also remains attached to the 
membrane as well as the previous virion by a small 
membranous neck. The continuation of this cycle and 
repeated initiation of budding results in the formation of 
consecutive scission-defective virions that resemble 
beads on a string [180, 181]. The same morphology has 
been reported for the Moloney murine leukaemia virus 
upon deletion and mutation of p12 protein that func- 
tions in its assembly and release [182]. 

While some enveloped viruses, like influenza A virus, 
encode their own scission proteins, other viruses rely on 
the host cell’s endosomal sorting complex required for 
transport (ESCRT) to accomplish this [179]. This dem- 
onstrates a necessity for viral-host PPIs but given the 
shortage of information available on CoV E-host PPIs, it 
is nearly impossible to say whether E mediates scission 
in an ESCRT-dependent manner or not. It is, therefore, 
essential that host cell candidates capable of interacting 
with CoV E be identified as they could be potential 
therapeutic targets for CoV antivirals to stop CoV scis- 
sion. Conversely, the release of influenza virions is medi- 
ated by the M2 protein in an ESCRT-independent 
manner. The amphipathic helix located in the cytoplas- 
mic tail of the M2 protein is both required and sufficient 
for the detachment of vesicle buds in an in vitro model 
system [184]. Mutation of the hydrophobic region of the 
helix also significantly reduced viral release in vivo, con- 
firming the importance of the 17-amino-acid-helix in 
the release of the influenza virus in vivo as well. In the 
absence of the M2 protein, buds formed inside infected 
cells but failed to detach and such cells exhibited the 
beads-on-a-string morphology. This suggests that M2 
can serve as a substitute for ESCRT complexes during 


Page 11 of 22 


influenza virus budding and, more importantly, raises 
the possibility of functionally equivalent M2’s in other 
enveloped viruses. 

Interestingly, an amphipathic a-helix is predicted to be 
located in the TMD of CoV E and has even been con- 
firmed in some of the CoVs [72, 76, 77, 135, 136, 138, 140, 
159, 185, 186]. It appears that no attempts have been 
made to determine whether E of any of the CoVs is re- 
sponsible for the scission of CoV virions during budding. 
However, expression of E alone has been reported to 
produce and secrete vesicles from cells but no further re- 
search has been done to determine how this is possible 
[60, 143]. Mutational studies would certainly benefit from 
EM analysis to determine what effects TMD mutations of 
E would have on virion budding. Electron microscopy can 
clearly demonstrate the consequences of mutated scission 
proteins and can even prove useful to ascertain what 
effects complete gene deletion have on viral budding. 


Release: Viroporin 

While the accumulation of E at the ERGIC points largely 
to a role in assembly and budding, only a small portion 
is incorporated into the viral envelope, suggesting that E 
has additional functions centred around the ER and 
Golgi region [66, 92, 109, 159]. Viroporins are viral- 
encoded membrane pore-forming proteins that can 
modulate cellular ion channels and have been suggested 
to regulate and function in multiple stages of the viral 
life cycle, from viral entry to assembly and release, and 
even pathogenesis [184, 187-196]. Although viroporins 
are not essential to viral replication, their absence does 
weaken or attenuate the virus and diminishes its patho- 
genic effects [35, 197-200]. They tend to be small pro- 
teins (~ 60-120 amino acids) of a predominantly 
hydrophobic nature that oligomerise in the membranes 
of infected cells, forming hydrophilic pores. The hydro- 
phobic residues line the outside of the structure, ori- 
ented toward the phospholipids, while the inside of the 
pore is made up of the hydrophilic resides [140, 159, 
201-204]. Most viroporins share certain structural fea- 
tures such as an amphipathic a-helix in the hydrophobic 
domain (HD) along with a cluster of positively charged, 
basic amino acids (such as lysine or arginine) which an- 
chor the pore to the membrane through electrostatic in- 
teractions with the negatively charged phospholipids 
(Fig. 5) [187, 205-207]. 

Viroporins can transport different ions but appear to be 
largely selective for the positively charged ions hydrogen 
(H*), K*, Na*, and calcium (Ca’*) [209, 210]. Although 
preferentially selective for cations, viroporins can also 
transport anions. The preference simply appears to be for 
cations over anions [211-213]. It is, however, interesting 
to note that, at a neutral pH, the ion selectivity of the re- 
spiratory syncytial virus (RSV) small hydrophobic (SH) 
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Fig. 5 Illustration of a typical viroporin structure and motifs. The pore of the viroporin (brown) is created by the amphipathic a-helix and the 
viroporin is anchored to a lipid bilayer by terminal positively charged residues (lysine or arginine). Conformational changes in the structure 
regulate the flow ions through the viroporin by opening (left) and closing (right) the pore [208] 


protein can change from cationic to anionic [214]. This 
suggests that viroporins are sensitive to changes in the cel- 
lular environment, a property that could be of therapeutic 
value. After all, the influenza A virus M2 protein is pH- 
gated and activates upon acidification of the endosome 
following receptor-mediated endocytosis of the virus 
[215]. In the same study, Schnell and Chou [215] showed 
that the anti-viral drug rimantadine exerts its anti-viral 
property by stabilising the M2 viroporin in its closed con- 
formation and in doing so inhibits viral replication [209, 
216]. Similarly, the E protein of several CoVs possesses ion 
channel activity, though the only structural data of the 
CoV viroporin has been derived from SARS-CoV using 
synthetic peptides [75, 135, 136, 138, 217, 218]. 

Synthetic peptides of SARS-CoV E demonstrate that 
the TMD is responsible for its ion-conductive properties 
[135, 136, 138]. Computational predictions and spectro- 
scopic studies show that the SARS-CoV E TMD under- 
goes oligomerisation, characteristic of ion-channelling 
proteins, to form a stable pentamer [75, 135-137]. Viro- 
porin formation appears to be mediated by ionic interac- 
tions rather than disulphide bonds as mutation of the 
porcine reproductive and respiratory syndrome virus 
(PRRSV) E protein cysteine residues appears to be 
dispensable for oligomerisation [219]. Research into the 
mechanism of viroporin formation is hampered by the 
hydrophobic nature of the TMD and has thus far been 
limited largely to mutational studies and the use of ion 
channel inhibitors such as amantadine and hexamethy- 
lene amiloride. 


The CoV E viroporin is equally cation-selective when 
it comes to its ion-channelling properties, demonstrating 
a preference for the monovalent cations Na” and K* 
[217, 218]. Synthetic peptides of SARS-CoV E, that re- 
semble the CoV E viroporin, are able to transport Na’, 
kK", and chloride ions (Cl-) but are more selective of Na* 
over kK” and least selective of Cl” [217]. Synthetic pep- 
tides that correspond to E from HCoV-229E, MHV, and 
IBV exhibit a similar cation-selectivity for MHV and IBV 
E as for SARS-CoV E. However, it is interesting that al- 
though the E viroporin synthetic peptides of HCoV-229E 
were still cation-selective, it exhibits a slightly higher 
selectivity for KY than for Na” [218]. The SARS-CoV E 
synthetic peptide findings were corroborated using a 
full-length SARS-CoV E protein [76]. More recently, 
purified full-length MERS-CoV E has also demonstrated 
limited ion-channelling properties and would benefit 
from a more comprehensive characterisation to establish 
whether it has ion-channelling properties similar to that 
of the other CoVs [140]. 

It should be cautioned that the charge on the lipid 
head group of membranes used can modulate the ion- 
selectivity of the viroporin. Neutral lipids appear to neg- 
ate the selectivity of the viroporin as the channels 
formed did not seem to differentiate cations from an- 
ions. In contrast, negatively charged lipids were more 
cation-selective than neutral lipids, being more perme- 
able to cations [76]. This suggests that the lipid head 
group of the membranes in use should be taken into 
consideration when interpreting the results as it might 
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skew the results and inaccurate conclusions may be 
drawn. At times, the ion channels were only marginally 
more selective of cations, bringing into question the ion- 
selectivity of the CoV E viroporin for one cation over 
another. In fact, an ion channel is only considered ion- 
specific when its permeability is nearly exclusive to one 
ion while extremely low to others [220]. Synthetic peptides 
corresponding to the full-length SARS-CoV E viroporin 
have also recently been shown to be capable of transport- 
ing Ca** and was linked to the inflammatory response 
often observed in ARDS [221]. This is the only study so 
far to have shown that the E viroporin of any CoV is 
capable of Ca** transport. 

Recent efforts have been directed toward understand- 
ing how mutant CoV E viruses carrying ion channel- 
inactivating mutations revert to their original pathogenic 
state. Mutants of SARS-CoV E carrying mutations N15A 
and V25F in the TMD restored ion channel activity by 
incorporating compensatory mutations in both in vitro 
and in vivo systems [77]. Mutant N15A reverted by in- 
corporating a single mutation that led to an amino acid 
change at the same position (A15D), creating a more 
stable mutant. Conversely, mutant V25F reverted to mu- 
tants with amino acid substitutions at either the same 
position (F25D) or positions relatively close to the ori- 
ginal mutation (L19A, F20 L, F26 L, L278, T30I, L37R). 
Intriguingly, the V25F mutants appeared as early as 2 
days after mice were infected where revertant mutant 
T30I surpassed the growth of the original virus by day 
two. This suggests that while some of these mutations 
appear to merely restore the loss of ion channel activity, 
it is not entirely inconceivable that revertant viruses 
would acquire gain of function mutations that can ren- 
der it more virulent [77]. Similar results were recently 
reported for IBV E TMD residues analogous to N15A 
and V25F (T16A and A26F) [222]. It is interesting to 
note that in both cases SARS-CoV E and IBV E followed 
a similar trend in their reversion: mutations at N15A 
and T16A both reverted by substitution of a single 
residue, whereas mutations at V25F and A26F produced 
revertants by acquisition of multiple residues. 

Some viroporins have been implicated in the release of 
viruses, but it is not yet known whether the release is 
mediated by the ion channel activity of the proteins 
[187, 223-226]. An intriguing study recently reported 
that both IBV infected and IBV E transfected cells exhib- 
ited a marked increase in the pH of the Golgi lumen 
[227]. These findings suggest that the IBV E viroporin 
could channel H™ and possibly mediate viral release by 
its ion channel activity. However, this increase in pH 
was found only in cells expressing a monomeric form of 
IBV E and not the oligomeric form as required for viro- 
porin formation. The authors proposed that the change 
in pH could be attributed to an interaction between the 
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monomeric form of E and a host protein. Although pos- 
sible, only a very small number of host proteins have been 
shown to interact with CoV E. The monomeric and oligo- 
meric forms were produced by transfection of mutated 
IBV E A26 to F26 (E***") and T16 to Al6 (E''**), respect- 
ively. In an earlier study, the same authors demonstrated 
that these two forms were present in IBV E-infected cells 
but that the monomeric form was much less (~ 10%) in 
infected cells than in transfected cells (~ 50%). The oligo- 
meric form, however, was the dominant form in infected 
cells [90]. This suggests that other viral proteins might 
affect or modulate the oligomerisation of IBV E. It is inter- 
esting to note that the M2 protein amphipathic helix motif 
was required for release of influenza A virus (IAV) parti- 
cles, perhaps indicating that this motif might be required 
for the processes budding, scission, and for viroporin 
activity [181]. It might be worth investigating whether 
ion-channel inhibitors, such as amantadine, or proton 
pump inhibitors specifically are able to inhibit this 
increase in Golgi pH. For now, though, it still remains 
to be seen whether CoV release is mediated by viro- 
porin ion channel activity or through PPIs with host 
proteins of the secretory pathway. 


Pathogenesis: ER stress response/unfolded-protein response 
(UPR) and apoptosis 
The ER can sustain a high load of protein content with- 
out being overwhelmed [228]. However, when the ER’s 
capacity for folding and processing proteins is exceeded, 
unfolded or misfolded proteins rapidly accumulate in 
the lumen and the ER stress response, or unfolded- 
protein response (UPR), is activated. The various signal- 
ling pathways that make up the UPR collectively func- 
tion by enhancing the folding of proteins, chaperoning, 
and ER-assisted degradation (ERAD) [229]. If, however, 
the UPR is prolonged and irreversible, apoptosis will be 
initiated [230]. By increasing the protein content, fold- 
ing, and processing of the ER, viral infections can also 
trigger the UPR and this pathway can be used by the 
host cell as an antiviral response [231]. Very few studies 
have looked at the role of CoV E in the ER stress re- 
sponse and its ability to induce apoptosis. In cultured 
cell lines, overexpressed MHV E and epitope-tagged 
SARS-CoV E induces apoptosis [87, 232]. However, cells 
infected with rSARS-CoV and rSARS-CoVAE, a more 
biologically relevant system, demonstrated that SARS- 
CoV E may regulate the UPR as part of its pathogenesis 
[233]. Cells infected with SARS-CoVAE exhibit a stron- 
ger stress response compared to cells infected with the 
wild-type virus. Moreover, a higher degree of apoptosis 
was observed in SARS-CoVAE-infected cells than in 
those infected with the wild-type virus. 

This study demonstrates the risk of interpreting data 
from overexpression and epitope-tagged studies. Results 
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generated by such studies might offer some insight into 
the putative functions of viral proteins but should be 
interpreted with great care as they can be misleading. 
Findings can only be more conclusive when supported 
by results from studies in more biologically relevant sys- 
tems. The study also shows that CoV E has an anti- 
apoptotic function in infected cells by suppressing the 
UPR during infection, likely as a survival mechanism 
and to continue viral propagation. This function of E 
has only been demonstrated in SARS-CoV so far, one of 
the most virulent HCoVs. It would be interesting to see 
whether E of the other CoVs, as well as the less virulent 
HCoVs, are also able to contribute to pathogenesis by 
regulating the host cell stress response. 


Immune response: Inflammasome activation 

Viruses often encode proteins that interfere with the im- 
mune system to either inhibit a response or enhance one 
as part of their pathogenicity. Some viral proteins disrupt 
components of the immune response pathways to disrupt 
the immune system and promote their viral evasion and 
pathogenesis [234—237]. Alternatively, viral proteins can 
modulate other cellular factors that could also disrupt the 
immune response to promote pathogenesis. Coxsackie- 
virus 2B protein promotes the internalisation of major 
histocompatibility complex class I (MHC-I) proteins and, 
in doing so, prevents their transport to the cell surface for 
immune recognition [238]. This protein also delays the 
transport of proteins along the secretory pathway by alter- 
ing the Ca** and H* concentrations of the Golgi and ER 
compartments and has been proposed to be a mechanism 
of immune evasion as well [239]. Influenza virus M2 pro- 
tein triggers activation of the NOD-like receptor family, 
pyrin domain containing 3 (NLRP3) inflammasome by 
creating ionic imbalances through its ion-channel activity 
[240]. Other viruses use viroporins to stimulate an im- 
mune response as part of their pathogenicity, including 
the E protein of PRRSV [241-243]. 

Inflammasome activation by CoV E was first reported 
in PRRSV [242]. Blocking ion channel activity with 
amantadine significantly inhibited activation of the 
inflammasome, demonstrating an association between E 
viroporin activity and inflammation. Recently, the trans- 
port of Ca’* by SARS-CoV E was shown to trigger 
inflammasome activation [221]. This establishes the link 
between inflammasome induction by SARS-CoV E and 
the inflammatory-mediated lung damage seen in SARS- 
CoV-infected mice [77]. Interestingly, despite attempts 
to inhibit ion channel activity in SARS-CoV E, by mutat- 
ing N15A and V25F, viruses restored ion channel activity 
by incorporating additional mutations after several pas- 
sages. The authors concluded that this ion-channelling 
function confers a selective advantage to the virus [77]. 
The reduction of inflammatory cytokines in the absence 
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of CoV E ion channel activity suggests that inhibition of 
the CoV E viroporin limits CoV pathogenicity and could 
be of therapeutic value to CoV infections. 


Future perspectives and conclusion 

While most CoV infections, such as those caused by 
HCoV-229E, HCoV-OC43, HCoV-NL63, and HCoV- 
HKU1, are mild and self-limiting, SARS-CoV and MERS- 
CoV cause severe infections that lead to high mortality 
rates [244—246]. There are currently no effective, licensed 
therapies for HCoV infections and existing treatment 
strategies are generally limited to symptomatic treatment 
and supportive care [26-—28, 247]. While an extensive 
amount of research has gone into identifying potential 
treatment options, most have only shown promise in vitro 
and will likely not progress further as they often have one 
or more limitations. Anti-viral candidates either exhibit 
only a narrow spectrum of activity, are only effective at 
unusually high therapeutic dosages or cause serious side 
effects or immune suppression [248]. A few studies have 
investigated the potential of rCoVs with a mutated E or 
lacking E, specifically focussing on SARS- and MERS- 
CoV, as live attenuated vaccine candidates with some 
promising results [34, 36, 165, 249, 250]. Vaccinated ani- 
mal models developed robust immune responses, both 
cellular and humoral, and were protected against infective 
challenges. This shows that CoV vaccines with mutated or 
deficient in E can potentially be used for prophylactic 
treatment, but the duration of immunity does not seem to 
have been established yet. 

Viruses exploit the extensive network of their host 
cell’s signalling pathways to promote viral replication 
and propagation [251, 252]. This dependence on PPIs of- 
fers the unique opportunity to target both viral-host and 
intraviral PPIs and, thereby, stop viral replication and 
propagation. Therapies that use small-molecule drugs 
have the advantage of small size, which allows the drugs 
to cross cell membranes efficiently, but it also severely 
limits the selectivity and targeting capabilities of the 
drug, which often leads to undesired side-effects [253]. 
Interactions between proteins take place over large, flat 
surface areas that feature shallow interaction sites. 
Small-molecule drugs, however, tend to bind to deep 
grooves or hydrophobic pockets not always found on the 
surface of target proteins, making it difficult for such 
drugs to disrupt PPIs (Fig. 6) [253-255]. Larger, protein- 
based therapies, on the other hand, make use of insulin, 
growth factors, and engineered antibodies, that form 
many more, and much stronger, interactions, making 
these therapies more potent and selective for their 
targets. Such properties result in fewer side-effects but 
the size of these agents also restricts their ability to cross 
the membranes of target cells [253]. This calls for thera- 
peutic agents that can bridge the gap between molecules 
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hGH:hGHR 


Fig. 6 Mechanisms of interaction between small molecules and proteins, and protein-protein interactions. Left: The binding of biotin to avidin 
occurs in a deep groove, while the interaction between the human growth hormone (hGH) and the NGH receptor (NGHR) occurs over a larger, 


flatter area [254] 


that are large enough to be specific and potent for their 
targets but still small enough to be able to cross target 
cell membranes efficiently and can also be manufac- 
tured easily. 

Stapled peptides fulfil these criteria to a large extent and 
have been applied to various human diseases and fields 
such as cancer, infections, metabolism, neurology, and 
endocrinology [256-260]. In fact, Aileron Therapeutics 
have already developed two stapled peptides, ALRN-5281 
and ATSP-7041. The company has already completed the 
first-in-human trail with ALRN-5281 for the treatment of 
rare endocrine diseases, such as adult growth hormone 
deficiency. Moreover, ATSP-7041 was designed to target 
intracellular PPIs, specifically murine double minute 2 
(MDM2) and murine double minute X (MDMX) [261]. 
To the best of the author’s knowledge, only a few studies 
so far have investigated the potential of stapled peptides as 
antiviral agents, with promising results for both intracellu- 
lar and extracellular targets. The focus so far has only 
been on HIV-1, RSV, and HCV [260, 262—265]. 

Granted, the therapeutic application of stapled pep- 
tides, particularly regarding viral infections, is still rela- 
tively new, but their numerous advantages give them 


tremendous potential as antiviral agents. Stapled peptides 
(1) can inhibit PPIs; (2) are more specific for their targets 
than small-molecule drugs, which also decreases the risk 
of unwanted side-effects; (3) can target diseases that are 
otherwise difficult to treat, referred to as “undruggable’; 
(4) can be modified easily to enhance membrane perme- 
ability, potency, and half-life; (5) have a short market time 
[253, 266, 267]. As more viral PPIs for CoV E are identi- 
fied, the repertoire of stapled peptide targets also expands 
making it easier to limit viral replication, propagation, and 
even pathogenesis. Stapled peptides have the potential to 
be used as antiviral agents that can work effectively at 
multiple levels. 

Autophagy is a cellular process that recycles excess or 
damaged cellular material to maintain the energy levels of 
the cell and ensure its survival. The material is removed 
from the cytoplasm by forming enclosed DMVs known as 
autophagosomes and then fused with lysosomes to be 
degraded [268, 269]. Recent studies have increasingly 
pointed to the involvement of autophagy components in 
viral infections [270]. Some suggest that it might have an 
antiviral function by inhibiting viral replication [271-273]. 
Others reported inhibition or subversion of autophagy as 
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a defence mechanism to promote viral propagation [274— 
276]. Others still, notably RNA viruses, appear to exploit 
autophagy for the purpose of viral propagation [277, 278]. 
Regarding CoVs, replication of TGEV is negatively regu- 
lated by autophagy [279]. Interestingly, PRRSV activates 
autophagy machinery, possibly to enhance viral replication 
as certain components of autophagy are required for 
MHV replication [280, 281]. These studies suggest the 
possibility of CoVs exploiting autophagy for replicative 
purposes. It has even been proposed that the DMVs 
formed in CoV-infected cells might be the result of au- 
tophagy and derived from the rough ER [281]. Recently, 
an increase in cytosolic Ca**, presumably from the ER 
lumen, has been implicated in autophagy induction by 
protein 2B (P2B) of the foot and mouth disease virus 
(FMDV) [282]. The rotavirus non-structural protein 4 
(NSP4) reportedly induces autophagy by a similar mech- 
anism [283]. Considering these studies, along with the 
ability of SARS-CoV to channel Ca”, it is not inconceiv- 
able that CoV E viroporin could induce autophagy in 
CoV-infected cells by increasing cytosolic Ca**. However, 
experimental evidence would be required to support the 
possibility of such a mechanism in CoVs. 


The multifunctional role CoV E protein: A central role in 
assembly, release, and pathogenesis? 

From studies, it appears that some viral proteins do not 
have unique, definitive functions. Despite the deletion of 
some viral genes, the viral life cycle continues, suggesting 
that other viral genes can compensate for this loss. It was 
recently shown to be the case for the vaccinia virus [284]. 
This is also evident in the varied requirements of the E 
protein for different CoVs and the reason(s) for this is not 
understood. Trafficking and maturation of TGEV virions 
is arrested without E [40]. Virions of MHV AE are capable 
of producing viable, replicating progeny [39]. Deletion of 
E from SARS-CoV attenuates the virus whereas, in the 
case of MERS-CoV, virions are propagation deficient [35, 
165]. Certain CoV accessory proteins appear to be able to 
complement, or sometimes even compensate for, the ab- 
sence of E in processes such as assembly, release, and the 
pathogenesis of some CoVs [30]. It is particularly note- 
worthy that SARS-CoV encodes two accessory proteins, 
3a and 8a, that might exhibit relative compensatory func- 
tions in the absence of E [285, 286]. In terms of viral repli- 
cation in vivo and in vitro, 3a could partially compensate 
for the loss of E. Moreover, 3a also contains a PBM and 
might be able to compensate for the loss of E to an extent 
but utilises different signalling pathways [285]. Although 
the study demonstrated that even the accessory proteins 
demonstrate some measure of dispensability, the virus still 
encodes these additional proteins with overlapping func- 
tions. The dynamics between these proteins, however, are 
not quite clear yet and warrants further investigation. 
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What is clear, though, is that viroporin proteins, case in 
point IAV M2, can exhibit a multitude of different func- 
tions independent of their ion-channel properties [153, 
184]. The studies in this review have shown that CoV E 
could be involved in multiple aspects of the viral replica- 
tion cycle: from assembly and induction of membrane 
curvature to scission or budding and release to apoptosis, 
inflammation and even autophagy. Although a lot of pro- 
gress has been made on CoV E, there is still much to be 
discovered about this small, enigmatic protein. 
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